A Character-based Indexing and Word-based Ranking Method for Japanese Text Retrieval

نویسندگان

  • Toshikazu Fukushima
  • Susumu Akamine
چکیده

7KLV SDSHU GHVFULEHV D -DSDQHVH WH[W UHWULHYDO V\VWHP WKDW ZH DSSOLHG WR WKH -DSDQHVH DG KRF ,5 WDVN LQ WKH 17&,5 :RUNVKRS $ FKDUDFWHU EDVHG LQGH[LQJ DQG ZRUG EDVHG UDQNLQJ PHWKRG ZDV LPSOHPHQWHG RQ WKLV V\VWHP 7KH V\VWHP JHQHUDWHV DQ LQGH[ IRU WLWOH! DEVWUDFW! DQG NH\ZRUG! SDUWV LQ GRFXPHQWV ,W SDUVHV RQO\ GHVFULSWLRQ! SDUWV LQ VHDUFK WRSLFV DV TXHULHV ,WV UDQNLQJ VWUDWHJ\ LV YHU\ VLPSOH ,W XVHV WKH YHFWRU VSDFH EDVHG RQ VKRUW XQLWV RI -DSDQHVH ZRUGV ,W GHOHWHV VWRS ZRUGV LQ D TXHU\ DQG FDOFXODWHV WKH 7) ,') VFRUH IRU HDFK GRFXPHQW ,WV DYHUDJH SUHFLVLRQ VFRUH IRU WKH WUDLQLQJ VHW RI VHDUFK WRSLFV LV ([SHULPHQWDO UHVXOWV VKRZ WKH HIIHFWLYHQHVV RI XVLQJ WKH VKRUW XQLWV RI ZRUGV DQG GHOHWLQJ VWRS ZRUGV LQ D TXHU\ .H\ZRUGV

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparing between the impacts of text based indexing and folksonomy on ranking of images search via Google search engine

Background and Aim: The purpose of this study was to compare the impact of text based indexing and folksonomy in image retrieval via Google search engine. Methods: This study used experimental method. The sample is 30 images extracted from the book “Gray anatomy”. The research was carried out in 4 stages; in the first stage, images were uploaded to an “Instagram” account so the images are tagge...

متن کامل

Berkeley at NTCIR-2: Chinese, Japanese, and English IR experiments

This paper reports on the work of Berkeley group at the second NTCIR workshop on Japanese & English IR and Chinese IR. A number of runs were submitted on all subtasks in the two main tasks. Our main focus on the Japanese monolingual subtask was on comparing the retrieval effectiveness of different segmentation methods. The experimental results show the bigram indexing outperformed the word-base...

متن کامل

RICOH at NTCIR-2

At NTCIR-2, RICOH submitted eight runs for the Japanese IR task. Of the eight runs, four runs use the title eld only and the other four use the description eld only. RICOH's system is built on our English text retrieval system and augmented to handle Japanese text. The system features (1) hybrid retrieval using a combination of n-gram indexing and wordbased document ranking; (2) word-based and ...

متن کامل

Thomson Legal and Regulatory at NTCIR-3: Japanese, Chinese and English Retrieval Experiments

Thomson Legal and Regulatory participated in the CLIR task of the NTCIR-3 workshop. We submitted formal runs for monolingual retrieval in Japanese and Chinese, and for bilingual retrieval from English to Japanese. Our main focus was in Japanese retrieval. We compared word-based and character-based indexing, as well as query formulation using characters and character bigrams. Our results show th...

متن کامل

Okapi Chinese Text Retrieval Experiments at TREC-6

The focus of the Okapi TREC{6 Chinese experiments is on investigating the e ectiveness of di erent automatic indexing methods and phrase weighting for retrieval based on probabilistic models over Chinese text. We compare di erent probabilistic weighting methods based on a range of word and single character approaches. There are two indexing methods used in our experiments. One indexing method i...

متن کامل

A New Indexing and Text Ranking Method for Japanese Text Databases Using Simple-Word Compounds as Keywords

This paper describes a new indexing method for Japanese text databases using the simpie keyword string. A compound word is treated as a string of simple words, which are the smallest units in Japanese grammar which still maintain their meanings. As a result, retrieved texts can be ranked, according to the similarity of their meaning and the query, without using a control vocabulary or thesaurus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999